ESFA Logo

OFFICIAL SENSITIVE

Interactive Analysis Template

Contents

Summary

This template demonstrates how to develop and share interactive reproducible analysis using Python and Jupyter Lab.

The data used in this template is for learning purposes only.

Packages

Data import

The data sources used are fictional exam scores from Kaggle and datasets from ONS on 18-24 student population over time and DfE on 19+ student populations by Local Authority. The datasets can be found at Kaggle and ONS, ONS GeoPortal and GOV.UK

Exam performance data

Source: Kaggle

The exam performance data comprises 5,000 records (rows) and 8 features (columns).

Student population data

Source: ONS

Student population by Local Authority

Data Source: GOV.UK
Boundaries Shapefile Source: ONS GeoPortal

Data processing

Transform the 18 to 24 student population from string 1,000 format to decimal 1000.00 using ASCII to float (atof).

Add year and month columns to 18-24 student population data.

Data exploration

Interactive analysis

Finding One

18-24 student population has doubled since 1992

Students numbers (aged 18-24) have almost doubled since 1992. In 2016, the student population was around 1,896,000

Finding Three

Recommendations

References